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Abstract 

We consider Achlioptas processes for fc-SAT formulas. We create a semi-random formula with n 
variables and m clauses, where each clause is a choice, made on-line, between two or more uniformly 
random clauses. Our goal is to delay the satisfiability/unsatisfiability transition, keeping the formula 
satisfiable up to densities m/n beyond the satisfiability threshold ctk for random fc-SAT. We show that 
three choices suffice to delay the transition for any k > 3, and that two choices suffice for all 3 < k < 25. 
We also show that two choices suffice to lower the threshold for all k > 3, making the formula unsatisfiable 
at a density below a k . 

1 Introduction 

The Erdos-Renyi model of random graphs undergoes a celebrated phase transition. Specifically, suppose we 
form a random graph G(n, m) with n vertices and m edges by choosing m times uniformly from the Q) 
possible edges. The average degree of this graph is d = 2m/ n. If d < 1, then with high probability in the 
limit n — > oo, G(n,m) consists almost entirely of trees, and the largest component has size O(logn). But if 
d > 1, then with high probability G(n, m) has a giant connected component containing 0(n) vertices. 

In 2001, Dimitris Achlioptas posed the following question. Suppose at each step we are presented with 
two uniformly random edges. We are allowed to choose between them, adding one of them to the graph and 
throwing away the other. We play this game on-line; that is, our choice can depend on the graph up to this 
point, but not on future pairs of edges. Can we delay the appearance of the giant component, ensuring that 
the largest component has size o(n) after m = cn edges for some c > 1/2? 

A positive answer was given by Bohman and Frieze [5], who showed that two choices suffice to delay 
the giant up to c = 0.535. Achlioptas, D'Souza, and Spencer [1] studied a particular rule where we choose 
the edge that minimizes the product of the component sizes of its endpoints, which exhibits a phenomenon 
they call explosive percolation. It is also possible to speed up the appearance of the giant component [131 16] ; 
Spencer and Wormald [27] showed that it can be brought into existence at c = 0.334. 

In analogy with G(n, m), we can consider random fc-SAT formulas Fk(n, m). Specifically, given n variables 
xx, ■ ■ ■ , x n , we create a fc-SAT formula by choosing m clauses uniformly from the 2 fc (^) possibilities. The 
satisfiability threshold conjecture states that there is a critical density aj. = m/n at which Ff.(n, m) undergoes 
a phase transition from satisfiable to unsatisfiable: 



Conjecture 1. For each fc > 2, there is a constant af. such that 

lim Pr[Fk(n,an) is satisfiablc] = < 



1 a < a/c 
a > ak ■ 
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This conjecture has been proved only for k = 2 [5J Q13 [T^], where Q!2 = 1. For the NP-complete case fc > 3, 
there are strong arguments from statistical physics that it is true, and very precise conjectures for the value 
of ctk from calculations using the cavity method [20] [5T] . 

There are rigorous upper and lower bounds on a k assuming it exists. That is, there are known values 
cnj° w , a)? Ish such that F k (n, an) is satisfiable if a < a k ow and unsatisfiable if a > o^ gh . In that case, we write 
a l r <a k < af gh . Specifically, for k = 3 we have [IB [El [IB] 

3.52 < a 3 < 4.4898, (1) 

while the cavity method gives 03 = 4.267. For arbitrary k, the first and second moment methods give [4 

2 fe ln2-0(fc) < a k < 2 fe ln2- (2) 

while the cavity method gives 

Sinclair and Vilenchik [26] asked whether Achlioptas processes can delay the satisfiability/unsatisfiability 
transition for fc-SAT. In other words, suppose at each step we are given a choice of two clauses, each of 
which is uniformly random. We choose one of them and add it to the formula, and our goal is keeping the 
formula satisfiable up to m — an clauses for some a > a k . They showed that two choices are enough to 
delay the 2-SAT transition up to a = 1.0002, and also that two choices can delay the fc-SAT transition for 
k = u(\ogn). Perkins [25] showed that for any k, there is a strategy with t choices, for a constant t, that 
delays the fc-SAT transition. In fact, his analysis shows that three choices suffice for sufficiently large fc, and 
that 7 choices suffice for all fc > 3. 

We improve these results in the following ways. First, we give a simple, nonadaptive strategy that, given 
a choice between three clauses, increases the fc-SAT threshold for all fc > 2. Secondly, we give a two-choice 
strategy that increases the threshold for all 3 < fc < 25, and we conjecture that it works for all large fc as 
well. Finally, we give a simple two-choice strategy that lowers the threshold for all fc. 

2 Three Choices Suffice to Raise the Threshold for all k 

In this section and the next, we show that a constant number of choices suffice to raise the satisfiability 
threshold. Our strategy is simple and nonadaptive. Indeed, it is oblivious to the "topology" of the formula, 
which variables appear together in clauses, and is sensitive only to the signs of the literals. Given a choice 
of t clauses, we choose the one with the largest number of positive literals. 

To show that the resulting fc-SAT formula is satisfiable, we convert it into an ^-SAT formula in the 
following way: for each fc-SAT clause c, we form an £-SAT clause by taking I of the most positive literals in 
c. If the resulting £-SAT formula is satisfiable, then so is the original fc-SAT formula. In Theorem [T] we use 
£ = 2; in Theorems HH we use £ = 3. 

We note that Perkins [3S] used a similar strategy, with £ = 2, to show that a constant number of choices 
suffice for any fc. Here we improve his results, showing that three choices suffice. 

Theorem 1. Three choices suffice to increase the fc-SAT threshold for any fc > 2. 

Proof. As described above, our strategy is simply to take the clause c with the largest number of positive 
literals. We then generate a 2-SAT formula by taking two of the most positive literals from each clause. 
Specifically, if c has two or more positive literals, we form a 2-SAT clause by choosing uniformly from all 
such pairs; if c has exactly one positive literal, we take it and choose uniformly from the fc — 1 others; and 
if all of c's literals are negative, we choose uniformly from all (Jj) pairs. 
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If c is the most-positive of t uniformly random clauses, then the probabilities that the resulting 2-SAT 
clause has 0, 1, or 2 positive literals are 

Po = 2~ fct 

p 1 = (2- k (k + l)) t - Po 

Pi = 1 - Po - Pi ■ (3) 

If there are m = an clauses, this gives a biased random 2-SAT formula with, in expectation, apon, ap\n 1 
and apin clauses of these three types. Note that the variables appearing in each clause are independent and 
uniformly random. 

Recall that a 2-SAT formula on n is equivalent to a directed graph on 2n vertices, corresponding to the 
literals Xi and Xi for each 1 < i < n. Each clause (xi V Xj) is equivalent to a pair of edges, namely the 
implications Xi — > Xj and Xj — > Xi. The formula is satisfiable if and only if no contradictory cycle exists, 
leading from Xi to Xi and back to Xi for some i. 

Unit clause propagation is the process of satisfying a unit clause, i.e. a clause consisting of a single literal, 
and generating the unit clauses implied by it and whatever 2-clauses that variable appears in. For instance, 
if (jti V Xj) is one of the 2-clauses in the formula, satisfying the unit clause (xi) will generate the unit clause 
(xj). In a random formula with apon variables, a positive unit clause (xi) will give rise, on average, to 2apo 
negative unit clauses (xj). Similarly, a positive unit clause will give rise, on average to p\ negative ones, and 
so on. Unit clause propagation is thus described by a two-type branching process, with a matrix aM where 

(4) 

where we treat the number of negative and positive unit clauses in the current generation as a column vector 
and multiply by M on the left. 

Given an initial unit clause u = or , the expected population generated by the entire process is 

(1 + aM + (aM) 2 + •••)• u. 

If aA < 1 where A is the largest eigenvalue of M, this series converges to (1 — aM)^ 1 -u, so in expectation just 
O(l) unit clauses are implied by the initial one. Intuitively, this makes it very unlikely that a contradictory 
loop of implications exists, and therefore suggests that the 2-SAT formula is satisfiable with high probability. 

Indeed, this was proved by Mossel and Sen [24]. They showed that the critical density for random 2-SAT 
formulas of this kind is exactly 

* 1 1 
a = — = t—= ■ 

A Pi + 2^p p 2 

For the unbiased case pi = 1/2 and po ~ P2 = 1/4, this reproduces the 2-SAT threshold a% = 1. Putting in 
our expressions §3§ for po , p\ , and P2 gives 



2fct/2 



2 -a/2(( fc + iy _ !) + 2y /i - (2- k (k + 1))* 

For large k, a* grows as 2 kt / 2 /2. If we set t = 3, then a* exceeds the fc-SAT threshold for all k > 3. In 
particular, for k — 3 we have a* > 4.86, which exceeds the best known upper bound on a^ of 4.4898 10J . 
For k > 4, a* exceeds the first moment upper bound 2 k In 2. □ 

Note that we have shown not just that three choices are enough to generate satisfiable formulas above the 
satisfiability threshold, but that these formulas can be satisfied in polynomial time: just use the polynomial- 
time algorithm for 2-SAT to find a satisfying assignment. For the case k = 2 and t = 2, we have also shown 
that two choices raise the 2-SAT threshold to 1.203, which improves the results of [26 j 125 ) . 

Note also that setting t = 1 in the proof of Theorem [T] shows that the threshold for random fc-SAT 
without any choices grows as a^ = Q(2 k / 2 ). This is far below the second moment lower bound £l(2 k ) [US], 
but the proof is much simpler. 
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3 Two Choices Suffice to Raise the Threshold for 3 < k < 25 

In this section we show that two choices suffice for k up to 25. We do this by analyzing simple linear- 
time algorithms with differential equations. Regrettably, these equations seem too complicated to solve 
analytically; thus we are not able to prove that these results hold for all k > 3, though we conjecture that 
they do. 

We start by showing that a particularly simple algorithm works for 5 < k < 25. We then use slightly 
more sophisticated algorithms to raise the threshold for k — 3 and k = 4. 

Theorem 2. Two choices suffice to increase the fc-SAT threshold for all 5 < k < 25. 

Proof. Our strategy is the same as before: given a choice of t clauses, take the one with the most positive 
literals. We then form a 3-SAT clause by choosing uniformly from among the most-positive triplets of literals. 
Analogous to ([3]), the probability that the resulting clause has 0, 1, 2, or 3 positive literals is 

Po = 2~ kt 

Pl = {2- k (k+l)) t - Po 

p2= ( 2 "(G) +fc+i )J~ pi 

Ps = 1 - Po - Pi - P2 ■ (5) 

Now consider the following algorithm, which we call BUC for Biased Unit Clause. At each step it sets some 
variable x permanently, removing clauses that agree with that setting and hence are satisfied, and shortening 
clauses that disagree with it. 

1. (Forced step) If there are any unit clauses, choose one uniformly and satisfy it. 

2. (Free step) Otherwise, choose x uniformly from all unset variables, and set x true. 

This is identical to the UC algorithm for random fc-SAT studied by Chao and Franco [7J [5] except that, on 
a free step, UC flips a coin to determine the truth value of x. If at any point we have two contradictory 
unit clauses, we simply give up rather than backtracking. Our goal is to use differential equations to show 
that BUC succeeds with positive probability. The existence of a nonuniform threshold [14] . which we claim 
applies to these biased 3-SAT formulas as well, then implies that they are satisfiable with high probability. 

After T of the variables have been set, let Sij(T) denote the number of i-clauses with j positive literals, 
for i = 2,3 and < j < i. Initially we have 5*3^(0) = apj-n and S2,j(0) = 0. Let qo(T) and qi(T) denote the 
probability that the variable on the Tth step is set false or true respectively. Then the expected change in 
Sij at each step is 

3 Q 

for all < j < 3 , E [AS" 3 7 ] = ^- + oil) 

n — T 

forall0<i<2, E [AS,,,] = (3 = j)qiS ^ ± { j± ~ 2S ^ + o(l) . 

The key fact behind these equations is that, at all times throughout the algorithm's progress, the formula 
consisting of the remaining clauses is uniformly random once we condition on the number of clauses of each 
type. In particular, the variables appearing in each clause are uniformly random among the n — T unset 
variables, as is the variable x set on a given step. Thus each 3-clause is either satisfied or shortened with 
probability 3/(n — T); if it has j positive literals and we set x false, then with probability j /(n — T) it 
becomes a 2-clause with j — 1 positive literals; and so on. 
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U BUC 

2 fc ln2 



3456 7 8 9 10 

4.232 9.491 24.306 66.811 190.806 554.106 1610.88 4637.05 
22.181 44.362 88.723 177.446 354.891 709.783 



Table 1: The lower bound ag UC achieved by choosing the clause with the most positive literals, and running 
the Biased Unit Clause algorithm on the 3-SAT formula consisting of one of the the most-positive triplets of 
each clause. For 5 < k < 25, o;b UC exceeds the first-moment upper bound on otk, showing that two choices 
are enough to raise the threshold. 

Rescaling to real- valued variables t = T/n and sy(t) = Sij(tn)/n in the usual way gives the differential 
equations 

forall0<i<3, % ^- (6) 
at 1 — t 

for all < j < 2 , ^ = ( fc + U + l)gb^ + i - 2s 2J 

~ J ~ ' dt l-t w 

with the initial conditions S3j(0) = apj and s% j(0) — 0. Then classic results 29] show that, with high 
probability, Sij(T) = Sij(T/n)n + o(n) for all T, where Sjj(i) is the unique solution to this system of 
differential equations. 

The caveat to this, of course, is that a contradictory pair of unit clauses does not appear. Standard 
arguments show that as long as the branching process of unit clauses stays subcritical throughout the 
algorithm, then the probability that no contradiction occurs, and that the algorithm succeeds in satisfying 
all the clauses, is 0(1). 

Analogous to Q , the unit clauses obey a two-type branching process between negative and positive unit 
clauses, where the expected number of children of each type is within o(l) of the matrix 

1 / s 2 ,i 2s 2 ,o 



m = —— : z ' L iv . (8) 

1 - t V 2s 2,2 S 2 ,l J K ' 

We can group steps together into rounds, where each round consists of a free step followed by a cascade 
of forced steps. Let A denote the largest eigenvector of M. As long as A < 1, the branching process is 
subcritical, and the total expected number 6o, b\ of variables set false or true respectively in a round is 

£) =(i+m+m 2 +. ..)•(;) =(i-m)- i .(; 

where we use the fact that the initial free step in each round sets a variable true. Averaging over many steps, 
but not so many that M changes appreciably, the probability that a variable is set false or true is 

bo h 



9o = 7 — —r ' <?i = 



bo + h ' 60 + 61' 

Similar analyses of multi-type branching processes in algorithms appear in [3j [17] . 

The differential equation (|10l) for S3 j is easy to solve: namely, s^j = apj(l — t) 3 . We integrate the rest 
of the system ([7]) numerically, and use binary search to find the largest a, up to some precision, such that 
max( A(t) < 1. In Tabled] we show the resulting lower bound ag UC for the first few values of k. For k = 3 
and k = 4, ague is below the conjectured values of the threshold [TH], namely 4.267 and 9.931. But for 
5 < k < 25, ague exceeds the first moment upper bound 2 fc In 2. □ 

Asymptotically, «buc seems to grow roughly as 2.5 fc . It is tempting to think that we can prove a lower 
bound on a^uc sufficient to show that two choices suffice for all k > 25 as well, but we have not been able 
to do that. 

The next two theorems use slight improvements to Theorem[2]to raise the threshold for k = 3 and k = 4. 
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Theorem 3. Two choices suffice to increase the 4-SAT threshold. 

Proof. Given two clauses, we again take the one with more positive clauses, but now we apply the BUC algo- 
rithm directly to the resulting 4-SAT formula. Most of the analysis of Theorem [2] goes through unchanged, 
except that the probability that a clause has a given number of positive literals is now 

1 _ 3 3 13 31 

Po ~ 256 ' Pl ~ 32 ' P2 ~ 8 ' P3 ~ 32 ' Pi ~ 256 ' 

The differential equations (O for the density of 2-clauses and the matrix M for the branching process of unit 
clauses (JSJ) are unchanged. The differential equations for 4- and 3-clauses are now 

, . ds4 i Asa i 

for all < 7 < 4 , — pt = -± 

- J - ' dt 1-t 

foraU0<j<3, = ( 4 ~ + (j + l)gb^- + i - 3^ (g) 

dt 1 — t 

and the initial conditions are S4,j(0) = apj and S3j(0) = S2j(0) = 0. 

Integrating this system numerically, we find that M's largest eigenvalue A is less than 1 up to a — 10.709. 
This is less than the naive first moment upper bound on a.±, but it exceeds an improved upper bound of 
10.217 based on counting locally maximal assignments [TT]. □ 

Finally, we use a biased version of the Short Clause (SC) algorithm, which Chvatal and Reed used to prove 
a lower bound on the 3-SAT threshold f9] , to show that two choices can delay the satisfiability transition in 
3-SAT. 

Theorem 4. Two choices suffice to increase the 3-SAT threshold. 

Proof. Once again our strategy is to take the more positive of the two clauses. The probability that a clause 
has a given number of positive literals is 

1 15 33 15 

P0= 64' P1= 64' P2= 64' P3= 64' 

We now analyze the following algorithm, which we call Biased Short Clause (BSC). 

1. (Forced step) If there are any unit clauses, choose one uniformly and satisfy it. 

2. (Free step) Otherwise, if there are any 2-clauses, choose one uniformly. If it has any positive literals, 
choose one uniformly and satisfy it. If both its literals are negative, choose one uniformly and satisfy 
it. 

3. (Really free step) If there are no unit clauses or 2-clauses, choose x uniformly from the unset variables 
and choose x's truth value uniformly. 

This is identical to Short Clause [9] except that, whenever possible, we satisfy the chosen 2-clause by setting 
a variable true. 

During the critical phase of the algorithm, there are O(n) 2-clauses, so we can effectively ignore the 
possibility of a really free step. Let phee denote the probability that a given step is free. The differential 
equations for 3- and 2-clauses are then 

forall0<j<3, % ^ (10) 
dt 1 — t 

for aU0<i<2, = {k - j)qiS ^ ± {J ± 1)g0 ^ +1 = - Pfrcc S -M , (n) 

dt 1-t S 2 ,o + S2,l + S 2 .2 

where the additional term is due to the fact that we choose and satisfy a random 2-clause on every free step. 
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As before, consider a round consisting of a free step followed by a cascade of forced steps, and let bo and 
bi denote the total expected number of variables set false or true during a round. The probability that a 
given step is free is 1 divided by the expected length of the round, 

1 

&0 + &1 

and the probability that a given step sets a variable false or true is q = b /(b + bi) and qy = bi/(b + b\) 
respectively. The matrix M describing the branching process of unit clauses is the same as in BUC. However, 
the initial population of unit clauses in each round is different. Rather than always setting a variable true, 
a free step sets a variable true if the chosen 2-clause has at least one positive literal, and otherwise it sets a 
variable false. Thus 

(i°)= — - — d-M)-^( s i° v 

Integrating this system numerically, we find that M's largest eigenvalue A stays below 1 for all t as long 
as a < 4.581. This exceeds the best known upper bound < 4.4898, completing the proof. □ 

All these results show that two choices are enough to create a formula at a density above at that can be 
satisfied, with probability 0(1), in linear time. 



4 Two Choices Suffice to Lower the Threshold, If There Is One 

We now show that two choices are enough to lower the satisfiability threshold if the threshold exists. If there 
is no threshold, we can still lower it; we explain below what we mean by this tongue-in-cheek statement. 

Theorem 5. Two choices suffice to lower the threshold for fc-SAT for any k, assuming that the threshold 
conjecture holds. 

Proof. Our strategy depends on the topology of the formula, but in a very simple way. Let < a < 1 be a 
constant to be determined. We simply prefer clauses whose variables are all in the set U = {27, x 2 , ■ . ■ , x an } 
to those with one or more variables outside U. 

If we have t choices, the probability that the chosen clause has all its variables in U is 

q = l-(l-a k Y, 

If the subformula consisting of these clauses is unsatisfiable, then so is the entire formula. But this subformula 
is uniformly random in Fk(n',m') where n' = an and E [to'] = qm. By the Chernoff bound, its density is 
arbitrarily close to 

/ m ' , l-(l-a fe )* 
a = — = cry where 7 = . (12) 

n a 

Thus the chosen formula is unsatisfiable w.h.p. if a > at/ J, lowering the threshold by a factor of 7. 

To confirm that there is an a such that 7 > 1, we maximize 7 as a function of a. Specifically, if t = 2 
then 7 is maximized at 

2k- 2^ 1/fe 



2k - 1 



where 

4k(k- 1) (2k- 1 \ 1/fe 1 , . 

This completes the proof. □ 
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We remark that a similar strategy shows that two choices are enough to create a giant component with 
m = cn edges where c = (3/8)^/3/2 = 0.459. 




What if we don't take the threshold conjecture for granted? Theorems [TH3] still "raise the threshold" 
unconditionally, in the sense that two or three choices give formulas that are w.h.p. satisfiable at densities 
where random fc-SAT formulas are w.h.p. unsatisfiable. We can give an analogous result for lowering the 
threshold: 

Theorem 6. For any fc, there is a constant t such that t choices suffice to generate formulas that are w.h.p. 
unsatisfiable at densities where random fc-SAT formulas are w.h.p. satisfiable. For sufficiently large fc, two 
choices suffice. 

Proof. Following the proof of Theorem [5l we just have to ensure that 7 > 7^ where jk — a fc Ish / a fc° w is the 
ratio between the best known upper and lower bounds on the threshold, i.e. the lowest and highest densities 
where random fc-SAT formulas are known to be unsatisfiable or satisfiable respectively. 

Examining (1121) . we see that for any fc and any 7fc there are a,t such that 7 > 7fc. For instance, let 
a = 1/(27^) and let t be large enough so that (1 — a k ) 1 < 1/2. 

For large fc, from ([2]) we have 7fc = 1 + 0(2~ fc fc), where O represents a constant independent of fc. Since 
from (fT3"|) we can achieve 7 = 1 + 0(l/fc 2 ) with two choices, there is some fco such that two choices suffice 
for all k > fco. □ 

For 3-SAT in particular, where the current value of jk is 4.898/3.52 = 1.275, maximizing 7 as a function 
of a shows that 6 choices suffice to lower the threshold unconditionally. 

5 Conclusion 

We have shown that three choices are enough to raise the satisfiability threshold in random fc-SAT, and that 
two are enough to lower it, for any fc. We have also shown that two are enough to raise it for fc < 25. We 
are left with several questions. 

1. Are two choices enough to raise the threshold for any fc? This seems incontrovertible, but we not see 
how to extend our analysis of Biased Unit Clause to arbitrary fc. 

2. Sinclair and Vilenchik [26] point out that if we are allowed to choose off-line, i.e. if we are given all 
pairs of clauses in advance, then with two choices can raise the fc-SAT threshold exactly to the 2fc-SAT 
threshold, since a choice of two fc-SAT clauses is equivalent to a 2fc-SAT clause. Can we do nearly this 
well in the on-line version? Or is there a stricter upper bound on how high we can raise the fc-SAT 
threshold with two on-line choices, say 0(2 ck ) for some c < 2? 

3. Our two-choice strategy for lowering the threshold does so by a factor of 1 + 0(l/fc 2 ). Is there a 
strategy with two choices, or a constant number of choices, that lowers the threshold by a constant 
factor for all fc? 
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